Supporting Multilingual Information Retrieval in Web Applications: An English-Chinese Web Portal Experiment

نویسندگان

  • Jialun Qin
  • Yilu Zhou
  • Michael Chau
  • Hsinchun Chen
چکیده

Cross-language information retrieval (CLIR) and multilingual information retrieval (MLIR) techniques have been widely studied, but they are not often applied to and evaluated for Web applications. In this paper, we present our research in developing and evaluating a multilingual English-Chinese Web portal in the business domain. A dictionary-based approach has been adopted that combines phrasal translation, co-occurrence analysis, and preand post-translation query expansion. The approach was evaluated by domain experts and the results showed that co-occurrence-based phrasal translation achieved a 74.6% improvement in precision when compared with simple word-by-word translation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supporting Multilingual Internet Searching and Browsing

The amount of non-English information has proliferated rapidly in recent years. The broad diversity of the multilingual content presents a substantial research challenge in the field of knowledge discovery and information retrieval. Therefore there is an increased interest in the development of multilingual systems to support information sharing across languages. The goal of this dissertation i...

متن کامل

Integrating Query Translation and Document Translation in a Cross-language Information Retrieval System

Due to the explosive growth of the WWW, very large multilingual textual resources have motivated the researches in Cross-Language Information Retrieval and online Web Machine Translation. In this paper, the integration of language translation and text processing system is proposed to build a multilingual information system. A distributed English-Chinese system on WWW is introduced to illustrate...

متن کامل

CMedPort: An integrated approach to facilitating Chinese medical information seeking

As the number of non-English resources available on the Web is increasing rapidly, developing information retrieval techniques for non-English languages is becoming an urgent and challenging issue. In this research to facilitate information seeking in a multilingual world, we focused on discovering how search-engine techniques developed for English could be generalized for use with other langua...

متن کامل

Multilingual Information Retrieval in World Wide Web

The article addresses: (1). The design of an information retrieval (IR), as the Multilingual Information Retrieval Tool Hierarchy (MIRTH), which with virtual corpora on the World Wide Web, also known as Web or WWW. It is motivated by the desire to create a search engine to retrieve information by accessing a virtual. (2). The implementation of a general model of multilingual retrieval for the W...

متن کامل

A Multilingual Information Retrieval Tool Hierarchy for a WWW "Virtual Corpus"

The article addresses: 1. the design of an information retrieval (IR) toolkit, named as the Multilingual Information Retrieval Tool Hierarchy (MIRTH) search engine, which works with virtual corpora on the World Wide Web, also known as the Web or WWW for short. It is motivated by the desire to create a multilingual search engine to retrieve information by accessing a virtual corpus; 2. the imple...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003